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DIFFUSION  APPROXIMATIONS  TO  OUTPUT  PROCESSES  OF  NON-LINEAR  SYSTEMS 
WITH  WIDE  BAND  INPUTS,  AND  APPLICATIONS 

it  it  it 

Harold  J.  Kushner  ' 

Abstract 

Many  problems  in  communication  theory  involve  approximations 
of  a  Markov  type  to  outputs  of  non-linear  (feedback  or  not)  systems, 
often  so  that  Fokker-Planck  techniques  can  be  used.  A  general 
and  powerful  method  is  presented  for  getting  diffusion  approxima¬ 
tions  to  outputs  of  systems  with  wide  band  inputs.  The  input  is 
parameterized  by  e  and  as  £  +  0  the  band  width  goes  to  00  (loosely 
speaking) .  It  is  proved,  under  reasonable  conditions  on  the 
systems  and  noise,  that  the  sequence  of  system  output  processes 
converges  weakly  to  a  Markov  diffusion  process,  which  is  charac¬ 
terized  completely.  Many  communication  systems  fit  the  model  of 
the  paper  and,  in  order  to  make  mathematical  sense  out  of  many 
common  developments  of  system  properties,  assumptions  such  as 
those  of  this  paper  are  often  required.  The  usefulness  and 
relative  ease  of  use  of  the  method  is  illustrated  by  application 
to  three  examples:  (a)  phase  locked  loop,  where  a  Markov  diffu- 
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sion  approximation  of  the  error  process  is  developed,  (b)  adaptive 
antenna  system,  where  an  asymptotic  analysis  of  the  equations  for 
the  system  is  given,  (c)  diffusion  approximation  to  the  output  of 
a  hard  limiter  followed  by  a  band  pass  filter;  input-output  S/N 
ratios  are  developed  (a  version  of  a  classical  problem  of  Daven¬ 
port)  .  Difficulties  with  the  usual  heuristic  approaches  to  (a) , 

(b)  are  discussed.  The  method  is  versatile  and  the  models  quite 
general.  Since  weak  convergence  methods  are  used,  the  approximate 
"limits"  yield  approximations  to  many  types  of  functionals  of  the 
actual  systems. 
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1.  Introduction 

Many  problems  in  communication  theory  involve  representations 
of  (or  approximations  to)  outputs  of  devices  (linear,  nonlinear, 
feedback)  whose  inputs  are  signals  added  to  a  relatively  wide 
band  noise;  e.g.,  phase  locked  loops  (PLL) ,  adaptive  antenna 
arrays  or  automatic  gain  controls.  Normally,  the  development 
of  the  output  representation  (or  approximation)  requires  special 
assumptions  (e.g.,  sinusoidal  inputs,  Gaussian  noise),  and 
various  heuristic  arguments  are  usually  needed  to  approximate 
the  output  by  Markov  diffusion  processes  whose  Fokker-Planck 
equation  is  to  be  analyzed  in  order  to  get  some  sort  of  approx¬ 
imation  to  the  statistics  of  the  true  output  process. 

In  this  paper,  a  rather  powerful  method  is  presented  for 
getting  either  the  usual  or  related  approximations,  under 
assumptions  which  are  reasonable,  explicit,  and  often  weaker 
than  the  usual  ones.  The  relative  ease  of  use  of  the  method 
is  illustrated  here  by  applications  to  three  rather  different 
problems:  (a)  the  PLL,  (b)  an  adaptive  antenna  array,  (c)  a 


version  of  Davenport's  [i  ]  result  on  the  output  of  a  band 
limiter  followed  by  a  zonal  filter. 

In  particular,  denote  the  input  noise  by  n e  C  -  )  where  as  £  -► 


0  the  bandwidth  (BW) 


Under  conditions  to  be  imposed,  the  se¬ 


quence  of  outputs  (with  input  signal  s(-)  plus  noise  ne(-))  will  con-  ■; 
verge  to  a  process  whose  state  variable  representation  is  a 
Markov  diffusion  process,  and  we  will  readily  be  able  to  find 

that  process.  Implf^it^li*  br',6Jqplieitl!y 'l(«olS-wie  indiic<Ltffficin  the 
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examples  below)  many  of  the  current  heuristic  arguments  use 
a  similar  assumption  on  the  noise  -  at  least,  the  "output 
approximation"  may  not  make  sense  unless  it  is  viewed  as  the 
limit  of  a  sequence  of  outputs  in  our  sense. 

Our  method  has  the  advantage  that  the  assumptions  are 
clearly  seen,  it  is  applicable  to  a  great  variety  of  situations, 
and  the  terms  which  a  more  heuristic  analysis  would  drop  can  be 
clearly  seen.  The  limit  is  in  the  sense  of  weak  convergence  of 
probability  measures  [  2 ] .  Thus  the  distributions  of  a  great 
variety  of  functionals  of  the  sequence  of  outputs  (with  para¬ 
meter  e)  converge  to  that  of  the  limit.  Furthermore,  under  cer¬ 
tain  circumstances  additional  information  on  approximations  to 
stationary  measures  can  be  obtained.  Also,  nonstationary  inputs 
can  be  accommodated.  The  fact  that  y(*)  can  occur  nonlinearly 
in  (2.1)  -  (2.3)  below  is  important  in  the  applications  which 
involve  some  nonlinear  processing. 

In  the  communications  literature,  the  problem  of  obtaining 
the  (Markov-dif fusion)  limit  of  the  sequence  of  outputs  of  a 
system  as  the  input  noise  BW  tends  to  ®  was  perhaps  initiated  by 
Wong  and  Zakai  [3],  [4]  in  a  very  special  case.  Later  cases 
were  treated  by  Khazminskii  [5],  Papanicolaou  and  Kohler  [6], 
Papanicolaou  and  Blankenship  [  7 ]  and  Kushner  [  8 ] ,  [  9 ] .  The 
treatment  here,  based  on  a  semigroup  approximation  of  Kurtz  [10], 
was  developed  in  [  8 ] ,  [  9 ]  to  get  limit  theorems  of  the  desired 
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In  Section  2,  the  basic  model  is  discussed,  together  with 
the  general  scheme  of  Kurtz  [10] ,  and  the  main  approximation 
theorems  appear  in  Section  3.  Sections  4,  5  and  6  deal  with  the 
three  problem  classes  (a),  (b)  and  (c)  mentioned  above.  The 
theory  is  developed  first  for  the  canonical  models  (2.1)— (2.3) . 
Often  in  applications,  such  as  those  in  Sections  4-6,  the  models 
are  a  little  different.  But,  as  we  will  see,  the  development 
give  for  the  canonical  model  tells  us  exactly  how  to  proceed  in 
the  other  cases.  In  a  project  currently  under  way,  the  method 
is  used  to  study  a  class  of  PLL's  with  non-linear  filters  (which 
seems  to  have  certain  advantages),  a  problem  which  has  not  been 
treated  and  for  which  there  seems  to  be  no  other  "natural"  method 
at  present. 


2.  The  Basic  Model 

Noise  model.  First,  we  derive  the  noise  model.  In  order 
to  conveniently  get  a  class  of  processes  nG(*)  whose  BW  goes  to  ® 
and  energy/unit  BW  converges  to  a  constant  /  0  as  e  +  0,  we 
work  with  ne(-)  of  the  form  n£ (t)  =  yz{t)/z,  where  ye (t)  =  y(t/e^), 
and  y  is  a  stationary  process.  Other  forms  are  possible.  In 
particular,  see  [9],  where  ne(*)  is  built  up  from  a  sequence 
of  small  correlated  effects,  each  of  whose  "size"  -*■  0  and  the 
number  of  which  (in  any  finite  interval)  goes  to  °°  as  e  -*•  0. 

Other  forms  are  possible  -  and  yield  rather  similar  results. 

One  way  or  another,  an  explicit  model  for  nG(*)  must  be  given 
which  allows  BW  +  =»  as  e  +  0.  The  selected  model  is  one  useful 


choice.  As  seen  below,  it  has  the  desired  properties.  The 
method  to  be  developed  can  handle  many  other  useful  noise  models 
as  well.  Basically,  the  noise  must  be  parametrized  in  such  a  way 
that  Theorem  i  can  be  adapted  to  the  problem. 

Suppose  that  y(*)  has  a  spectral  distribution  S(w).  Then 

that  of  nE(-)  is  S(e^w)  =  (w) .  See  Fig.  1.  Note  that  the 

2 

t/e  scaling  spreads  the  BW  (and  the  "center"  frequency  in  the 
band  if  there  is  one  (as  in  Fig.  lb))  and  the  1/ t  scale  keeps 
the  energy  per  unit  BW  from  degenerating.  Without  the  1/c 
factor  the  energy  per  unit  BW  goes  to  zero  as  e  -»•  0 ,  and  all 
the  limits  are  "noiseless".  We  do  not  require  that  y(-)  has  a 
spectral  distribution.  The  above  remark  is  for  motivation  only. 

System  models.  There  are  several  canonical  forms  with  which 
we  can  work.  The  system  outputs  can  be  representable  (state 
variable  form)  by  one  of  the  related  ODEs  (ordinary  differential 
equations) • 

(2.1)  *£  =  G(xc,ye,t)  +  F(xe,ye,t)A, 

(2.2)  xC  =  G£(xc,yc,t)  +  F£  (xc,ye,t)/e, 

(2.3)  x£  =  Gc(xe,t)  +  F£(xe,y  e,t)/e,  xc  (0)  G  Rr  , 

Euclidean  r-space. 


In  order  to  simplify  the  development,  we  deal  mostly  with 
(2.1),  and  then  show  how  to  extend  the  result.  As  will  be  clearly 
seen  in  Sections  4-6,  the  forms  (2.1)- (2.3)  cover  many  common 
cases,  and  the  method  to  be  described  can  readily  be  extended  to 
many  other  cases.  The  t  argument  accounts  for  the  presence  of  the 
signal  or  other  non-stationarities  (see  Sections  4-6) . 

r  r 

Weak  convergence.  Let  D  [0,°°)  be  the  space  of  R  -valued 
functions  on  [0,°°)  which  are  right  continuous,  have  left-hand 
limits  and  let  the  space  have  the  Skorokhod  topology  ( [  2  1 » 

Section  14  ) ,  as  is  usual  in  studies  of  weak  convergence.  Each 
process  xe  ( • )  has  paths  in  Dr[0,oo).  in  fact  the  paths  are  con¬ 
tinuous.  Let  x  ( • )  be  a  process  whose  paths  are  also  in  Dr[0,°°). 

£  D 

We  say  that  x  (•)  ■+■  x(-)  (weak*  convergence  of  the  corresponding 
measures)  iff  for  each  bounded  continuous  real-valued  f(-)  on 
Dr[0,»),  Ef (xG ( • ) )  -►  Ef  (x  ( •  )  )  .  This  is  a  considerable  general¬ 
ization  of  convergence  in  distribution,  and  is  a  concept  which 
is  very  useful  in  many  areas  of  probability  and  statistics.  In 
addition,  if  xG  ( • )  ■*  x(*)  weakly  in  Dr[0,“)  and  f(.)  is  bounded 
measurable  and  only  almost  everywhere  continuous  with  respect  to 
x(*)  measure,  then  Ef(xG(*))  -*■  Ef(x(»))  as  e  -*■  0  also. 

A  sequence  (xG(*)}  is  tight  iff,  for  each  6  >  0,  there  is 
a  compact  set  in  Dr[0,<=°)  such  that  sup^P  {xG ( • )  C  K{}  _<  6. 

Suppose  that  x(-)  has  continuous  paths  w.p.  1.  Then  the 
two  usual  steps  in  proving  weak  convergence  are:  (i)  showing 
convergence  of  finite-dimensional  distribution  of  (xG(*)}  to 
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those  of  x(*);  ( i i )  showing  tightness  of  {x'  (•)}.  These  imply 

the  weak  convergence.  The  theorems  given  below  do  all  this 
efficiently.  Our  limit  process  x(-)  will  be  a  Markov  diffusion, 
and  the  theorems  below  allow  us  to  calculate  its  infinitesimal 
operator.  For  more  detail  on  weak  convergence  see  [2], 

An  example  of  the  limit  operator.  Let  denote  the 

y~ 

set  of  bounded  continuous  functions  on  [0,°°)  x  R  with  compact 

A(^  o 

support,  Sf  '  the  subset  with  continuous  a-partial  t-deri- 

A 

vatives  and  6-partial  x-derivatives ,  and  let  denote  the 
closure  of  ^  under  uniform  convergence.  Under  conditions  to 
be  imposed  (including  EF(x,y(s),t)  =  0  for  all  x,  t) ,  and  with 
the  model  (2.1),  the  infinitesimal  operator  (3/3t  +  A)  of  the 
limit  process  x(‘)  is  (acting  on 


(2.4)  (3/at  +  A)  f  (x ,  t )  =  f  t  ( x ,  t )  +  Ef^(x,t)G(x,y (0)  ,t) 


EF' (x,y (0) , t) (F' (x,y (s) , t) f  (x, t) )  ds 

X  X 


=  l  b .  (x, t) f  ( x , t )  +  4  l  a .  .  (x, t ) f  (X,t), 

i  i  i » j  J  i  j 


where  b(*,*)  and  a(*,-)  =  {a^j  (•*•)}  are  defined  in  the  obvious 
manner,  and  we  assume  that  a(*,«)  is  symmetrized  to  conform  with 
the  usual  form  of  the  operator  A.  Define  G(x,t)  *  EG (x,y (0 ) , t) . 


-/o 


If  there  is  a  matrix  o(-,*)  such  that  a ( -  ,  -  )  = 
a  (•/•) o (*,•)' /2 ,  then  there  is  a  standard  vector-valued  Wiener 
process  B(-)  such  that  x(*)  has  the  Ito  equation  representation 

(2.5)  dx  =  b(x,t)dt  +  a(x,t)dB. 

This  is  the  case  in  the  examples.  Note  that  b(x,t)  contains  two 

components  -  the  first  is  G(x,t)  and  the  second  is 
00 

JEF' (x,y (s) , t) F (x,y (0) , t) ds ,  where  we  define 
0 


The  last  term  arises  for  the  same  reasons  that  cause  the  Wong- 
Zakai  [3],  [4]  correction  term;  i.e.  the  interaction  between 

xe(t)  and  ne(t).  As  it  turns  out,  the  typical  heuristic  argu¬ 
ments  used  to  deal  with  problems  (a),  (b)  obtain  "limits" 

without  the  "correction  term". 

Some  definitions.  Let  denote  expectation  conditioned 
on  nE(s),  s  <_  t.  If  kc(*)  is  an  co,t  function  such  that  for 
each  T  <  °°  and  for  (w,t)  £  ft  x  [0,T],  it  is  measurable  on  the 
product  a-algebra  .#[0,T]  *  &  (ne  (,s)  ,  s_<T)  ,  we  say 
that  ke  £  the  class  of  progressively  measurable  functions. 


*  j# [0 ,T]  is  the  Borel  algebra  over  [0,T]. 


-//- 


Let  k  and  kL  be  in  Sf L .  We  say  that  p-lim  kL  =  kL  iff 
sup^sup^E  |  k^  (t)  |  <  00  and  E  |  k^  (t) -kc  (t)  |  -*•  0  as  n  -*■  00  for  each 

t.  Let  1/'Z  C  ye  be  the  subclass  of  functions  k  such  that 

A  r-  _ 

suptE|k(t)|  <  00.  Let  0  denote  the  subset  of  of  p-right 

continuous  functions;  k  being  p-right  continuous  means  that 
k  £  i/  and  for  each  t,  E  |  k  (t+s )  -k  (t )  |  -*•  0  as  s  +  0 .  If  (some 
version  of)  p-lim^  [E^k0  (t+s)  -ke  (t)  ]  /s  exists  in  ,  it  is 
called  Ack£  and  we  say  that  ke  €  £^(Ae),  the  domain  of  the 

operator  Ac,  an  operator  which  is  analogous  to  the  weak  infin¬ 
itesimal  operator  of  a  Markov  semigroup.  If  kGE  ,  we  say 
that  p-lim  kE  3  0  if  sup  .  E  |  k.e  ( t )  |  <  °°  andE|kG(t)|  +  0  as 

o  f  u 

e  -*•  0  for  each  t.  The  functions  introduced  in  Theorem  2  and  it 

its  proof  have  progressively  measurable  versions. 

Kurtz's  semigroup  approximation  theorem  [IQ],  adapted  to 
our  purposes.  We  treat  t  as  a  component  of  the  state  vector, 
in  order  to  allow  us  to  work  with  nonstationary  cases.  The 
conditions  will  be  commented  on  below.  They  are  more  readily 
verifiable  than  may  be  apparent.  The  following  theorem  [3]  is 
the  basis  of  our  method. 

F  F  F  T*  +  T“  * 

Theorem  1.  Let  Z  (•)  =  (x  (*),y  (*))  be  a  sequence  of  R 
valued  right  continuous  processes,  x(*)  a  (Rr-valued)  Markov 
process  with  semigroup  T  ( •  )  mapping  into  and  which  is 


I 


► 

r 
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•  \ 

strongly  continuous  on  (sup  norm)  .  For  some  A  >  0  and 

A  A I 

set  ^  C  -Sf  (which  will  be  ’  ),  let  Range  ( A-A-3  /3  t 


2) 


dense 

be 


dense  in  where  A+3 /3 t  is  the  infinitesimal  operator  of  the 

process  (x(t),t).  Suppose  that,  for  each  f  E  ^ ,  there  is  a 
sequence  if6}  of  progressively  measurable  functions  such  that 
f£  E  £2(A  e)  and 


p-lim[fE(')-f(x£(*))]  =  0 

p-lim[ A  Efe (• )-  (A+3/3t)f (xe (•),•) ]  =  0. 

Then  if  xE(0)  ->  x(0)  in  distribution,  the  finite-dimensional 
distributions  of  xe ( • )  converge  to  those  of  x ( • )  (with  initial 
condition  x  ( 0 )  )  as  e  -*■  0. 

Remark .  There  is  a  similar  theorem  for  tightness  of  {xc(*)} 
which  is  particularly  useful  for  the  types  of  problems  encoun¬ 
tered  here.  In  fact,  if  the  finite-dimensional  distributions 
converge  (Theorem  1) ,  a  proof  of  tightness  under  reasonable 
conditions  is  not  hard.  See  [8,  Theorem  2,  Part  4] 
for  a  method. 

In  (2.4)  the  operator  (A+3/3t)  was  defined  on  a  set 
& q'  .  In  Theorem  1,  (A+3/3 1)  is  considered  on  a  dense  subset 

^  6  if  (which  ^q'3  is).  The  question  of  concern  is:  does 

this  restriction  of  (A+3/3 t)  define  the  infinitesimal  operator 

A 

of  a  Markov  semigroup  uniquely?  If  the  closure  in  *£  of  this 
restricted  operator  is  the  infinitesimal  operator  of  x(*),  then 


I 


A+3/9t  defines  TC)  uniquely.  Since  we  can  only  work  (in  the 

proofs)  with  nice  classes  such  as  &  -  and  not  with 

the  domain  of  the  infinitesimal  operator  of  x(-),  it  is  important 
to  know  if  &  is  big  enough  to  yield  the  limit  uniquely.  In 

fact,  the  condition  on  the  density  of  the  range  holds  in  all  the 
cases  of  Sections  4-6,  and  is  rather  unrestrictive . 

Due  to  lack  of  space,  it  is  not  possible  to  discuss  the 

relative  advantages  of  Theorem  1  (or  Theorems  2  and  3)  to  the 

approximation  problem  over  more  classical  semigroup  approxima¬ 
tion  methods.  It  seems  to  be  much  easier  to  use  in  the  usual 
problems  encountered  in  control  and  communication  theory 

dnd  the  relevant  proofs  (e.g.  that  of  Theorem  2)  are  shorter 
and  use  better  conditions. 

The  "density"  condition  together  with  the  condition  on 
strong  continuity  of  T(>)  can  be  eliminated  by  an  alternative 
approach  [11]  which  replaces  them  by  the  simple  assumption-  that 
to  the  coefficients  a(-,>)  and  b(*)  of  A  there  corresponds  a 


sense  of  distributions) .  This  condition  also  holds  in  our 
examples.  The  proof  of  the  theorem  corresponding  to  Theorem  2 
in  that  case  would  be  almost  the  same.  We  stick  to  an  approach 
based  on  Theorem  1  because  it  is  also  applicable  and  the  refer¬ 
ences  are  currently  available. 

We  next  give  some  specializations  of  the  theorem  suitable 
for  our  applications.  Theorem  1  is  given  in  the  general  form 


because  it  shows  how  to  modify  the  following  specializations  when 
variants  are  required  for  particular  cases.  We  omit  explicit 
discussion  of  tightness,  due  to  lack  of  space.  Our  conditions 
will  guarantee  the  tightness,  via  the  methods  of  proof  of  [8], 


3.  The  Main  Convergence  and  Approximation  Theorems 

We  start  with  the  form  (2.1)  and  bounded  y('),  because  it 
is  good  enough  for  many  applications  and  illustrates  the  tech¬ 
nique  with  the  least  notational  encumbrance.  Then  we  discuss 
the  case  where  y{>)  is  unbounded  and  F(x,y,t)/t  =  F(x,t)y/e. 
Finally,  we  remark  on  the  cases  (2 . 2 ) - (2 . 3),  which  actually  occur 
in  some  of  the  examples. 

Assumptions 

(Al )  F (•,*,* )  and  G (•,*,* )  are  continuous,  the  first 

(second,  resp.)  having  two  (one,  resp.)  continuous 
(in  x,  y,  t)  partial  x-derivatives . 

(A2 )  |F(x,y,t)|  +  |G(x,y,t)|  _<M(l+|x|),  for  some  constant 

M. 

(A3)  y(*)  is  stationary,  bounded,  right-continuous, 

EF (x , y (s ) , t )  =  0,  each  x,  t,  and  y(-)  is  strong 

mixing  in  the  sense  that  there  is  a  function  p ( • ) 

00  1/2 

satisfying  fp  '  (s)ds  <  °°  and 
0 
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sup  |  P  (  b!  A)  -P  (B)  |  p  (t)  , 

A,  B,  S 

A  <E  &  (y  (u)  ,  u  <s)  ,  BE  £  (y  (u)  ,u>s+t)  . 

(Such  a  condition  is  quite  common  in  the  literature 
on  applications  of  weak  convergence  theory.  It  is 
satisfied  by  truncated  Gaussian  processes  with 
finite  BW  and  continuous  spectrum,  by  bounded 
ergodic  Markov  chains,  etc.) 

A1  3 

(A4)  The  operator  A +3 /9 t  is  the  restriction  to  % q'  of 

the  infinitesimal  operator  of  a  strong  Markov  process 

A  A 

with  semigroup  T ( • )  mapping  %  into  %  and  being 

A 

strongly  continuous  on  . 

A 

(A5)  A+9/3t  on  its  domain  in  is  determined  by 

its  action  on 

Remark.  (A4)-(A5)  hold  in  our  cases  and  in  the  usual  situations 
which  arise  in  communication  theory.  They  pertain  only  to 
the  limit  x ( • ) ,  and  not  to  the  x° ( • ) .  Further  remarks  appear 
in  [8] .  See  also  the  comments  at  the  end  of  the  last  section 
concerning  simplifying  the  conditions. 

Theorem  2.  Let  xe(0)  -*■  x(0)  in  distribution.  Then,  under 
(A1)-(A5),  {xG  ( - )  }  converges  weakly  in  Dr[0,°°)  _to  x(*),  a 
diffusion  whose  infinitesimal  operator  (9 /9  t+A)  is  given  by 


(2.4),  and  v-’ith  initial  condition  x(0) 


Outline  of  proof.  The  proof  lor  the  non- time-dependent 
case  appears  in  [  8  1  •  and  is  a  direct  application  of  Theorem  1 
(for  convergence  of  finite-dimensional  distributions),  and 
another  result  of  [10]  for  tightness.  Given  f  £  q'  '  the 
main  object  is  to  get  the  sequence  {f£(-)}  of  Theorem  1,  and  to 
verify  the  p-lim  requirement  of  that  theorem.  We  outline  this 

because  various  extensions  of  the  method  are  needed  for  the 
examples,  and  it  is  useful  to  have  an  explicit  outline  for  the 
time-dependent  case,  since  the  forms  of  some  of  the  functions  are 
a  little  different.  Reference  [8]  dealt  only  with  G,  F  not 
depending  on  time,  but  the  time-dependent  case  is  important  for 
applications  and  requires  only  a  few  changes  from  the  treatment 
in  [8].  The  method  of  getting  the  fe(-)  is  adapted  from  the 
averaging  method  in  [7] . 

Let  f  £  ^0'  .  We  construct  fc  in  the  form  fe(t)  = 
f(xc(t),t)  +  £?_Qf^ (xc (t) ,t) .  Our  f£  is  the  cf£  of  [8],  our 
f0  is  split  off  from  the  e2f2  term  in  [  8  ]  and  our  f£  is  the 
remainder  of  the  e  term  there.  The  terms  are  modified  to 
account  for  the  explicit  t-dependence  of  f,  G,  F.  Note  that 
(use  x  =  xc(t)  for  notational  simplicity) 

(3.1)  Aef(x,t)  =  f  t ( x , t )  +  f^(x,t)  [G(x,ye(t) ,t)  +  F(x,ye(t) ,t)/e] . 
Define  f£(x,t)  by 
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fg(x,t)  =  EEf^ (x, t+s ) [G (x,yE (t+s ) , t+s ) -G (x, t+s) ] ds 
0 

oo 

=  e2  EEf ^  (x,  t+e2s )  [G  (x,y  (^jj+s )  ,  t+e2s ) -G  (x ,  t+e2s)  ] ds . 

0  E 

The  integral  exists  for  each  e  >  0  by  (Al) ,  (A3)  and  the  compact 

support  of  f.  Also  fE(t)  =  fQ(xe(t),t)  »  0(e2)  and  fE(-)  €  £2(AE). 
Again  use  x  =*  xE  (t)  .  Then 

(3.2)  AEfE(x,t)  =  -f^(x,t)G(x,yE (t) ,t)  +  f^  (x,  t)G  (x,  t)  +  0(e)  terms 

Note  that  Ae [f (x,t)+fE (x,t) )  =  ffc(x,t)  +  f  ^  (x ,  t )  G  (x,  t )  + 

0(e)  terms  +  f^ (x, t) F (x,y£ (t) , t) /e .  The  term  G(x,yE(t),t)  in 
(3.1)  has  thus  been  replaced  by  its  average  G(x,t)  modulo  an 
0(e)  term.  This  was  the  reason  for  the  addition  of  the  fE  term. 

A  similar  "averaging"  scheme  will  be  used  to  replace  the 
f^ (x, t) F (x,yE (t ) , t) /e  term  by  the  rest  of  Af  modulo  0(e).  This 
will  be  done  in  two  steps  by  using  the  f^  and  f^  defined  below. 
Proceeding,  define  f^  by 
00 

, 

(3.3)  fE(x,t)  =  —  Ee f ^ (x , t+s ) F (x, y E (t+s) , t+s ) ds 

0 

00 

=  e  EEf '  (x,t+e2s)F(x.y  ,t+e2s)ds  =  0(e). 

.  *-  x  p* 

0  E 

Furthermore,  f^(*,t)  is  differentiable  in  x  and  fE(*)  = 

fE(xE(-),*)  G  i3MAE)  and  (again,  setting  xE  (t)  »  x) 
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(3.4)  Aef^(x,t)  =  -f^(x,t)F(x,ye  (t) ,t)/e  +  0(e) 

+  (f^(x,t) )^[G(x,yc (t) ,t)  +  F(x,yc (t) ,t)/e] . 

It  can  be  shown  that  the  gradient  (f^)x  can  be  obtained  by  dif¬ 
ferentiating  with  respect  to  x  under  the  integral  in  (3.3).  Also 
(f*:  (x,  t)  )  .'.G  (x,ye  (t)  ,  t)  =  0(e),  and  we  ignore  this  component 

1  X 

henceforth.  The  fir3t  term  on  the  right-hand  side  of  (3.4) 
cancels  the  last  term  of  (3.1).  To  get  the  p-lim  result  required 
for  Theorem  1,  we  need  now  only  choose  f^  to  "cancel  the  effect 
of" 


(3.5)  (f  ^  (x,  t)  )^F(x,y£  (t)  ,  t)  /e  =  *  Je£  ( f  ^  (x ,  t+s )  F  (x ,  yL  (t+s )  ,  t+s )  )  xds 

0 

•  [ F  {x,^  (t>  ,  t )  ]  . 

e 

Define  A^f  (the  average  value  of  (3.5)  -  change  variables 
s/e2  -*■  s  and  use  the  stationarity  of  y  ( * )  )  by 


(3.6)  A^f (x,t) 


EF '  (x, y (0 ) ,  t )  [f^(x,t+e2s)F(x,y(s),t+e2s)  ]  x<ls . 

0 


(3.6)  exists  by  the  strong  mixing  (A3)  and  the  fact  that 
EF(x,y(s),t)  =0.  As  £  +  0,  (3.6)  converges  uniformly  in  x,  t 
to  the  integral  in  (2.4). 

Now,  define  f^tXjt)  by 


00 


CO 


f2(x,t)  = 


ds  { 

0  0 


du  E^F '  (x,yL  (t+s)  ,  t+s)  [f^(x,t+s+u) 

e 

•  F (x , y L (t+s+u) , t+s+u) ]  -  A^fCx/t+s)} 


00  00 


=  e 


ds  { 

0  0 


du  E^F'  (x,y (— rr+s ) , t+e2s)  [f '  (x, t+e2s+e2u) 

L  Z  X 

e 

F  (x,y  (^H-s+u)  ,  t+e2s+e2u)  ]  x  -  A^f  (x , t+e2s ) } 


0  ( e2 )  . 


2 

The  integral  exists  and  equals  0(e  )  by  the  centering  about 
the  mean  value  A^f,  the  strong  mixing  (A3)  and  the  compact  sup¬ 
port  of  f.  Now, 


p-lim  [f  q  +  f[  +  fp  =  0, 

p-lim  [Aef e  ( •  )  -  (9/3t+A)f(xe  (•),•)]  =  0, 


and  Theorem  1  yields  the  convergence  of  finite-dimensional  dis¬ 
tributions.  The  tightness  argument  is  the  same  as  that  in 
[  8  ,  Theorem  2] .  The  proof  concludes  by  noting  that  now  all 
the  conditions  of  Theorem  1  hold.  Q.E.D. 

Unbounded  y(’)  and  form  F(x,y,t)  =  F(x,t)y,  G(x,y,t)  = 
G(x,t)  +  GQ(x,t)y.  The  treatment  of  the  unbounded  (e.g. 

Gaussian  y(*))  case  is  similar  to  that  of  the  bounded  y(*)  case. 


(3.7)  du  -  Audt  +  Bdw, 

A  asymptotically  stable,  w(*)  =  Wiener  process.  Since  such 
processes  constitute  the  class  of  Gaussian  processes  with 
rational  spectral  densities,  (Al')-(A4')  are  certainly  not  res 


trictive. 


-*/- 
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In  [  8],  the  unbounded  y ( - )  case  was  treated  in  Theorem  5, 
and  GQy  was  not  explicitly  included.  The  proof  there  goes 
through  without  any  additional  conditions  or  difficulty  if  GQy 

is  added  -  provided  that  GQ  has  continuous  x-first  partial 
derivatives.  In  that  proof,  it  was  difficult  to  work  with  un¬ 
bounded  F,  G  when  y(*)  was  unbounded,  so  the  following  assump¬ 
tion  was  added  (adapted  to  our  case  here). 

( A6  * )  For  each  N ,  there  are  functions  FN ,  GN ,  G^  equal 
to  F,  G  and  GQ,  resp.  ,  in  SN  =  {x:  |x|_<N},  but 

bounded  and  smooth  (as  smooth  as  F,  G-,  GQ  are) 
out  of  SN,  and  such  that  (A4),  (A5)  hold. 

This  condition  is  normally  satisfied  and  holds  in  our  examples. 

The  reference  to  (A4) ,  (A5)  can  be  dropped  if  the  approach  in  [11] 
is  used  (it  is  then  replaced  by  uniqueness  of  the  solution  to  the 
I  to  representation  of  x ( • ) ) . 

Theorem  3.  Under  (A2),  (A1')-(A6'),  and  x£(0)  x(0)  in  distri¬ 

bution,  the  finite-dimensional  distributions  of  (xe(*)}  converge 
to  those  of  x ( • ) .  If  y(*)  is  given  by  a  linear  combination  of 
the  states  of  (3.6),  then  {xe  ( • )  >  is  also  tight  and  {xe(*)}  ■+ 
x(-)  weakly  in  Dr[0,°°). 

The  proof  is  similar  to  that  of  Theorem  2.  Given 
f  jf  q  ,  we  construct  f  as  in  Theorem  2  and  prove  the  p-lim 
requirements  of  Theorem  1.  See  [  8]  for  the  details  in  the 
non-time-varying  case. 


Extensions  to  (2.2),  (2.3).  When  F  and  G  depend  on  e, 

^1  3 

the  procedure  is  exactly  the  same.  Given  f  G  % ^  ,  we  construct 

fC  as  done  in  Theorem  2,  making  sure  that  the  integrals  are  well 

defined  and  of  the  proper  order  in  e,  and  replacing  G,  F  by  G^, 

F£.  We  need  EG£  (x  ,y  (0)  ,  t)  -►  G(x,t),  a  continuous  function, 

a  i  3 

uniformly  on  bounded  (x,t)  sets,  and  that  for  each  f  G  r  q'  , 


(3.8)  f  t  ( x ,  t )  +  G’  (x,t)  fx  (x,t)  + 


E  F^  (x,y  (0)  ,t) 


[f(x,t+e2s)F  (x,y(s) ,t+e2s) ]  ds 

X  w  X 


converges  uniformly  on  bounded  (x,t)  sets  to  ( 3/9t+A) f (x , t) . 

Example  (c)  (Section  6)  requires  a  slightly  different  exten¬ 
sion,  but  the  general  idea  is  the  same. 


4.  The  Phase  Locked  Loop 

The  standard  PLL  is  represented  in  Fig.  2  and  is  ,  perhaps,  the 
simplest  application  of  the  foregoing  ideas.  Via  suitable 
choices  of  D,  E,  C,  Q  all  the  usual  filters  can  be  constructed. 

We  first  do  the  case  D=E=C=0,  Q=l,  which  yields  the 
standard  form  of  the  first-order  loop  [ 12] •  The  general  case 
is  treated  in  precisely  the  same  way,  and  is  given  below.  The 
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function  0(*)  denotes  the  input  phase  process  and  9 ( * )  its 
estimate,  as  determined  by  the  loop.  Then 

(4.1)  0e=  |^[sin (0-0E)  +  sin  (0  +  0C+2o)ot)  ]  +  K  cos (wot+0L ) -nE (t) , 

and  we  have  the  situation  of  either  Theorem  2  or  3,  under  broad 
conditions  on  y(-),  where  we  use  nc  (t)  =  yE(t/c2)/e. 

The  usual  method  of  getting  an  approximate  diffusion  equa¬ 
tion  proceeds  roughly  as  follows  [12].  First  the  "double 
frequency"  terms  are  dropped  (which  is  justified  for  large 
irrespective  of  the  filter  form,  as  we  see  below) ,  then  it  is 
argued  that  since  0e(-)  varies  "much  more  slowly"  than  ne(*)»  one 
can  replace  n  (•)  by  a  white  Gaussian  noise  of  the  same  power 
per  unit  BW  that  nC(‘)  has  within  the  (say)  pass  band  of  the 
filter. 

Actually,  as  the  BW  of  nc(-)  increases  (justifying  in  a 

G  ^  £ 

sense  the  "increasing"  independence  of  n  (t)  and  0  (t)),  the 

magnitude  of  nE  ( • )  must  increase  (see  Section  2)  making  the 
replacement  of  nL(-)  by  white  Gaussian  noise  a  little  worrisome. 

In  fact,  it  was  a  "paradox"  arising  from  a  problem  of  this  sort 
which  apparently  led  to  the  original  Wong-Zakai  [  3  1  work. 

The  weak  convergence  method  of  Theorems  1-3  explicitly  yields 
the  correction  term,  and  ov,ing  to  the  nature  of  weak  converg¬ 
ence,  the  distributions  of  many  functionals  of  the  limit  are 
close  to  those  of  xc(*);  for  example,  first  exit  times  from 


-*$■- 
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aPPropriate  sets.  Indeed,  in  order  to  study  approximations  to 
path  properties  of  xe(-)  via  x(-),  weak  convergence  seems 
to  be  the  appropriate  technique.  This  is  an  important  advantage 
which  the  traditional  methods  do  not  have. 

Define 

oo 

R  =  Ey  (0 ) y  (s ) ds , 

0 

and,  in  order  to  fix  ideas,  let  y (•)  satisfy  the  noise  conditions 
of  either  Theorem  2  or  3.  All  the  other  conditions  of  these 
theorems  hold.  Then  the  theorems  yield  that  { 0 e ( - )  }  converges 
weakly  to  the  process  §(•)  with  the  operator  (3/3t+A)  given  by 

(4.2)  (3  /3  t+A)  f  (0  ,  t)  =  f  (0,t) 

a  A  V  A 

+  fx(e,t)  lysin<e_e)+sin(6+B+2w0t)  ] 

+  f  ~K2Rcos  (u)ot+0)  sin  (u>0 1+0  )  ] 

+  K2Rcos^  (ojpt+e  )  f^^  (6  ,  t)  . 

The  quantity  2R  is  roughly  the  power  per  unit  BW  of 
£  t 

n  (•)  for  small  e.  Also,  as  £  +  0,  / ne(s)ds  =  xe (t)  converges 

0 

to  a  Wiener  process  with  infinitesimal  covariance  2R.  To  see 

•  •  £  £ 

this,  set  x  =  n  ,  and  use  Theorem  2  or  3  as  appropriate,  to  get 
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that  xG  converges  weakly  to  a  process  x(*)  with  infinitesimal 
operator  (3  /3  t+R  32/3x2),  i.e.,  to  a  diffusion 

dx  =  / 2R  dw 

where  w(*)  is  a  standard  Wiener  process. 

From  the  form  of  (4.2)  it  is  easily  seen  that  the  limit 
©(•)  is  a  Markov  diffusion  with  an  It6  process  representation. 
In  particular,  there  is  a  standard  Wiener  process  B(-)  such 
that  §(•)  is  represented  by 

a  a  K  A  /v 

(4.3)  d0  =  ^[sin(0-0)+sin(e+0+2u)ot)]dt 

-  K2R  cos (wot+0) sin (wot+0)dt  +  K/2R  cos (wQt+0 ) dB . 

The  "correction"  term,  the  second  one  on  the  right,  is  not 
accounted  for  by  the  traditional  analysis,  and  arises  due  to  the 
non-independence  of  0e(t)  and  ne(t).  It  is  proportional  to  K2R. 
For  large  power/unit  BW  of  ne(*)/  or  large  system  gain,  this 
term  might  be  of  importance. 

The  general  rth-order  loop.  For  general  D,  E,  C,  Q  in  Fig. 
2,  0e  =  Cve  +  Q{r.h.s.  of  (4.1)}  and  the  limit  process 
(v (•),©(•))  is  representable  by  an  ltd  equation  of  the  form 

v  D  p 

(4.4)  d (  )  =  (  )v  dt  +  (“)[r.h.s.  of  (4.3)]. 

0  ^  u 
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The  result  for  the  general  filter  is  just  as  easy  to  get  as  the 
result  for  (4.1),  since  the  general  filter  only  affects  the  G 
term  (in  the  notation  of  (2.1). 

The  limit  as  -*■  °°.  We  consider  (4.3)  as  -*•  °°.  The 
same  result  holds  for  (4.4).  It  is  not  hard  to  see  that  the  two 
middle  terms  of  the  right  side  of  (4.3)  should  disappear  as 
cuq  but  it's  a  little  harder  to  see  what  to  do  about  the 

cos  (odgt+O )  coefficient  of  dB,  since  this  coefficient  depends  on 

0.  The  result  will  be  the  "traditional"  one,  but  it  is  often 
dangerous  to  use  heuristic  methods  to  treat  problems  involving 
"products  of  white  noise  and  state  variables".  Write  the  solu- 

A 

tion  to  (4.3)  as  0(<*>q,*).  We  have  the  following  theorem. 

Theorem  4.  {0(ii>Q,*)}  is  tight  in  D[0,°°)  and  as  a)Q  -*■  a>,  it 

converges  weakly  to  the  process  0  ( • )  given  by 

A  TVV  A 

(4.5)  d0  =  y^sin (0-0 )dt  +  K/R  dB. 

Proof.  Apply  Theorem  1  directly.  The  u )  indexes  the 

sequence  rather  thaft  e.  The  proof  will  be  outlined  only.  The 

9(<*>0,t) 

state  is  x  (t)  =  (  ).  All  functions  (cos  (u)Qt+0 )  , 

,  A 

sin  (wQt+0 ) ,  etc.)  in  (4.3)  are  Lipschitz  continuous  in  0,  uni¬ 
formly  in  t,  u)q,  and  all  are  bounded.  From  this,  we  can  easily 
show  that  there  is  a  constant  C  such  that 


E  |  0  (u)Q,t+s)-0  (u)Q,t)  |4  Cs2, 


all  t,  s,  coq. 
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By  [2,  Theorem  12.3],  this  implies  tightness  of  {6(Wg,*)}. 
Efc  denote  conditioning  on  B(s),  s  _<  t. 

Next,  fix  f  £  SfJ'3.  Define 

2ir/wr. 

a)  r  0 

f  °(t)  =  2?  I  Efcf  (§  (W(),t+s)  )  ,t)ds. 

0 

w0  -^0 

Then  it  can  be  shown  that  f  (•)  j  ^(A  )  and  that 


^“0  w0 

A  °f  U(t) 


Efcf  (0  (o>0. 


2tt/uj 


2tt 


t+2TT/u0)  ,t)-f  (0  (wq, t) 
2tt/coq 

0 

Efcft  (0  (o)Q,  t+s)  ,  t)ds. 


,t) 


Owing  to  the  uniform  Lipschitz  condition,  we  can  show  that 
p-lim[fa)°(*)  -  f  (0<uo,  •),*)]  =  0, 
p-lim[Aa)°fW°(*)  -  0/3t+A)f  (§(•),*)]  =  0, 


Let 


by  which  Theorem  1  guarantees  convergence  of  finite-dimensional 
distributions.  This,  together  with  the  tightness,  guarantees 
the  weak  convergence.  Q.E.D. 


5.  Adaptive  Antenna  Arrays  [13],  [14],  [15] 

For  another  illustration  of  the  general  idea,  we  consider  a 
standard  problem  in  adaptive  antenna  arrays.  Let  the  array  have 
r  elements  and  input  vector  z£(*)  =  {z j('),  j=l , ...,r},  where 
z£(-)  =  Sj(')  +  n  j  ( •  )  and  s^  (t)  =  A  cos  (o^tt^ )  ,  where  the  {<|>j} 
are  assumed  known  (known  signal  transmission  direction) .  The 
input  to  the  jth  (each  j)  antenna  is  split  into  two  parts,  one 
part  passing  through  an  ideal  tt/ 2  phase  lag  device  whose  output 
we  denote  by  z£(*)  =  n ^  ( - )  +  s^(-).  The  2r  outputs  are  weighted 
and  added  to  yield  the  "array  output"  X£(«). 

Define  Z  =  (z^,...»z^,  ,  z^,...»z^_)  =  (z  ,  z  ),  denote 
the  respective  weights  by  W  =  (w^,...,wr,  w^,...,wr)  = 

(w,w)  and  set  S  =  (s^,...,sr,  s^,...,§r)  =  (s,s). 

The  object  is  to  adaptively  adjust  W  in  order  to  adaptively 
maximize  the  signal- to-noise  power  ratio  in  the  output 
XC  =  w’z£  +  w’z£  =  W’Z£.  Again,  for  convenience,  suppose  that 

c  O  c  — 

the  noise  takes  the  form  n  (t)  =  y(t/e  )/e  =  y  (t)/e.  Let 
and  Me  denote  the  covariance  matrices  of  the  vectors  (y(0),y(0)) 
and  (n£(0),  n£(0)),  resp.  Then  Me  =  M^/e2.  Assume  that  Mg  >  0 
(in  the  sense  of  positive  definite  matrices).  Then  the  optimum 
weight  vector  equals  WQ  =  kMg^Sg,  where  k  /  0  is  any  constant 
and  SQ  =  {cos^j,  j^r,  sin4>..,  j<r}.  Define  Ye  ( * )  =  (ye  ( • )  ,ye  ( * ) ) , 
Y(-)  =  (y (• ) ,y (* ) )  and  M£(-)  =  Z£(*)  (Ze  (•))'•  Then  M£ ( • )  E 

S  9 

EM^ (• )  =  Mn/e2  +  (s  (• ) ,s  (• ) )  (s  (• ) ,s  (• ) ) ' .  The  scheme  of  Fig. 
s  u 

3  is  a  standard  method  [14],  [15]  of  adaptively  approximating 


-J/- 
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the  optimum  WQ .  The  describing  ordinary  differential  equation 
is  (gf  gg  are  positive  constants,  t  is  a  time  constant)  (5.1), 
where  W(‘)  should,  we  hope,  converge  to  something  "close  to"  WQ . 

(5.1)  tWc  =  -WE  -  gZeXc  +  gQS0 

=  -  [gZe  ( Ze  )  '+I]We  +  gQS0 

A  standard  method  of  treating  (5.1)  (see,  e.g.  [14]  -  or 
papers  in  [  15]  )  involves  first  dropping  the  signal  component 
of  Ze,  and  then  arguing  as  follows:  since  Me(*)  is  wide-band 
and  W£ ( • )  is  much  smoother  than  Me(«),  the  two  are  essentially 
independent,  so  assume  this,  take  expectations  in  (5.1)  and 
replace  (5.1)  by  the  resulting  equation  (5.2),  which  ought  to 
be  (approximately)  an  equation  for  the  mean  value  We ( • )  of 
We  ( • )  . 

(5.2)  iwe  =  -[gMSijw8  +  gQs0. 

—  £  — 8  - 1  ,  , 

The  asymptotic  solution  to  (5.2)  is  W  ■  g^ [gM  +1]  Sq  which  is 
close  to  the  optimal  value  if  g  is  large. 

Prom  a  mathematical  point  of  view,  there  are  some  difficul¬ 
ties  with  this  line  of  reasoning  -  even  allowing  for  the 
usually  justifiable  neglect  of  the  s(*)  terms  in  (5.1).  As  the 
BW  of  nc(*)  increases,  thereby  "justifying”  the  "almost  indepen¬ 
dence"  assertion,  the  covariance  Me  must  also  increase  (see 


-JS- 
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Section  2),  so  it's  not  inunediately  clear  what  one  can  say  about 
the  expectation  of  the  product  of  MG(t)  and  WG (t) .  To  see  the 
problem  more  clearly,  consider  the  scalar  case  tWg  = 

-(g(n  (s) )2+l)We+g0S0  (where  we  set  s ( * )  **  0),  solve  it  and  take 
expectations  to  get 

t 

(5.3)  EWe(t)  -  E (exp  -  (g(nG (s) ) 2+l)ds)WG (0) 

0 

t  t 

+  d*  E  exp  -  | [g(n£ (u) ) 2+l]du  g0SQ, 

0  8 

which  can  differ  considerably  from  the  solution  to  (5.2)  for 
small  e. 

We  now  set  the  problem  up  in  a  way  that  admits  an  asymptotic 

analysis  (as  e  ->■  0)  .  Clearly  g  must  be  inversely  proportional 

to  e  for  otherwise  Me  -*■  °°  as  e  -*■  0.  Suppose  that  an  automatic 

gain  control  mechanism  of  some  sort  is  available  and  that  we 

obtain  an  estimate  of  the  power  in  M£ ( • ) ,  which  is  proportional 
2  2 

to  1/e  .  Thus,  let  g  =  e  K  for  some  K  >  0.  Write  6MG(.)  = 

MG(’)  -  EMG(-),  and  rewrite  (5.1)  in  the  form 

(5.4)  tWe  =  -le2KMge+  e2K6MsG  +  I]We  +  g0SQ 

=  -[KM0  +  e2KSS '  +  eK(YeS’+SYG' )  +  K (YEYe ' -MQ) +1 ) We 

+  goV 


Define  6MQ ( * )  =  Y ( * ) Y ' ( • ) -MQ  and  6M^(t)  =  6M0(t/e2).  Then 
it  is  clear  that  ( Y  ( •  )  »  <$MQ  ( • )  )  plays  the  role  of  the  noise 
y(-)  of  Theorems  2  and  3.  Theorem  3  (extended  to  (2.2))  is  applic¬ 
able  here,  and  its  conditions  are  better  than  those  of  Theorem  1 
for  this  case.  If  y(*)  satisfies  the  (reasonable)  conditions  of 
Theorem  3,  then  {W£(*)}  is  tight  and  converges  weakly  to  the 
solution  of  (5.5)  as  e  ■+•  0.  Here  the  limiting  diffusion  is 
degenerate  because  there  are  no  1/e  terms  in  (5.4): 

(5.5)  TW  =  -[KMQ+I]W  +  g0SQ. 

This  type  of  argument,  with  the  appropriate  scaling  of  g,  justi¬ 
fies  the  end  result  of  the  traditional  treatment,  namely  going 
from  (5.1)  to  (5.2).  Note  that,  owing  to  the  degeneracy  (the 
limit  is  an  ODE) ,  the  f^  component  of  fe  in  the  proofs  of 
Theorems  2  or  3  is  not  needed,  and  owing  to  this  (A3')  can  be 
dropped  from  Theorem  3.  We  note  in  passing  that  the  scaling 
g  -*•  (K/average  power)  is  often  used  in  practice  due  to  "dynamic 
range"  considerations.  So  our  scaling  conforms  with  practice  - 
even  if  this  particular  practice  i9  not  traditionally  used  in 
the  development  of  (5.5),  it  is  actually  required  for  its  jus¬ 
tification. 

First-order  noise  effects.  The  system  (5.4)  can  readily 
be  centered  and  scaled  in  order  to  get  the  first-order  noise 
effect.  Define  UC(-)  =  [Wc ( • ) -  W(  • )  ]  /  e.  A  comparison  of  the 


following  development  with  that,  say,  in  [14]  reveals  some  of 
the  mathematical  shortcomings  of  the  usual,  more  heuristic 
approach.  Then 

(5.6)  U£  =  -K[M0+I]UL  -K(YeS'+SYC' )WL-  KeSS’W C 

-  K(6Mq/g)W  -  K6MqUC,  Ue  (0)  =  0. 

Theorems  2  or  3  can  be  applied  and,  again,  (Y ( • ) , 6MQ  ( • ) )  plays 
the  role  of  y(«)  in  those  theorems.  If  (Y ( • ) , 6MQ ( • ) )  satisfies 
the  conditions  on  the  y(-)  of  those  theorems,  then  {Uc  ( - )  }  is 
tight  and  converges  weakly  to  a  process  U ( - )  with  the  Ito  equa¬ 
tion  representation 

(5.7)  xdU  =  -K [Mq+I] Udt  +  K  dB, 

where  B(*)  is  a  non-standard  Wiener  process  whose  covariance  can 
be  obtained  from  the  {a^}  in  the  operator  A  in  (2.4)  in  the 
following  way. 

The  operator  (A+3/3t)  is  given  by 
(A+a/3  t)f  (U,t)  =  ft(U,t)  +  f;<U,t)[-K(M0+I)U]  +  (5.8),  where 

(5.8)  is  the  integral  term  in  (2.4)  (note  W(-)  is  not  random) 


-35" 
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(5.8)  K  |  EW ' ( t ) 6Mq ( 0 ) f uu (U,t)6MQ(s)W(t)ds 


=  K  trace  f  (U 
uu 


,t)  J  ESM0(0)W(t)w'(t)  6Mq  (s ) ds 


-L  /  J  1  J 


(If  the  {a^j (t) }  in  (5.8)  is  not  symmetric,  then  symmetrize  it 
so  that  a^j  =  a^.)  The  "infinitesimal"  covariance  of 
B (t+dt) -B (t)  is  {a^j(t)}dt.  Then,  to  first-order  terms  and  with 
wide-band  input  noises,  W(t)  =  W(t)  +  eU  (t)  .  Note  that  the  limit 
equation  (5.7)  does  not  have  a  "correction"  term  since  the  1/e 
term  in  (5.6)  does  not  involve  UG.  The  lack  of  a  "correction" 
term  is  not  a  priori  obvious,  however. 


Convergence  on  [O,00).  Normally,  the  part  of  UG(*)  that  is 
of  most  interest  is  the  "tail".  We  would  like  to  know,  for  example, 
that  the  distributions  of  UG (t)  ,  t  >  T,  are  close  to  those  of 
the  stationary  solution  to  (5.7)  for  small  e  and  large  enough 
T.  Weak  convergence  does  not  quite  give  this  type  of  result. 
However,  in  this  case,  a  useful  result  is  not  hard  to  get.  We 
only  state  it  -  the  details  of  proof  of  a  similar  case  are  in 
I  9  ]. 

If  y(*)  is  a  bounded  process,  then  it  can  be  shown  that 


(5.9) 


sup  E|Ue(t)|2  is  bounded 

t>0,  c  small 


and 


£4 
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(5.10)  {Ue(T+-),  T>0,  e  small)  is  tight  in  D  [0,°°) 

and  U£(T+*)  tends  weakly  to  the  stationary  solution  to  (5.7),  as 
e  -*•  0  and  T  ■*  °°  in  any  way  at  all. 

A  proof  is  in  [  9  ] .  (5.8)  is  the  key,  and  is  obtained 

via  a  Liapunov  function  stability  analysis  of  (5.6). 

The  Liapunov  function  Ve  (*),  constructed  according  to  a 
method  in  (  9  ] ,  has  the  form  Ve ( • )  =  V ( • )  +  V£ ( • )  +  V£ ( • )  + 

V£(*)/  where  V ( * )  is  a  Liapunov  function  for  the  deterministic 
system  (5.5)  and  Ve  is  a  perturbation  calculated  from  V(-)  more 
or  less  the  way  fe  was  calculated  from  f(*)  in  Theorem  2.  The 
ability  to  obtain  results  of  the  type  stated  in  (5.8)  and  below 
it  is  a  very  useful  byproduct  of  the  method  discussed  in  this 
paper.  In  fact,  the  traditional  method  of  analysis  of  this  prob¬ 
lem  assumes  some  sort  of  asymptotic  stationarity  and  stability  [14] . 


6.  Filtered  Hard  Limited  Signal  Pius  Noise 

For  the  final  example,  we  consider  the  case  of  a  continuous 
signal  plus  noise  s (t)  +  nC  (t)  passing  through  a  hard  limiter 
(level  L)  followed  by  a  band  pass  filter.  In  a  classical  paper, 
Davenport  [16]  treated  a  form  of  this  problem  where  s(t)  = 

A  cos  pt  and  nc(*)  had  total  power  N  and  a  spectrum  in  a  fixed 
band  centered  around  p.  He  obtained  specific  values  for  the 


ratio 


(6.1)  [(signal  power)  /Noise  power  out  -  /[(signal  power).  /N] 

in  a  band  around  p  in 

in  the  two  limiting  cases  of  N  +  0  and  N 

Our  assumptions  are  a  little  different.  Here,  s(*)  is  an 
arbitrary  continuous  function.  Again  we  set  ne(t)  =  y(t/e2)/e 
and  pass  s(t)  +  nc(t)  through  a  hard  limiter  and  then  through 
any  filter  which  has  a  linear  differential  (or  even  nonlinear, 
if  we  wish)  equation  representation.  The  stochastic  differen¬ 
tial  equation  which  represents  the  output  is  derived,  and  from 
it  we  can  readily  obtain  a  limit  value  for  an  input-output  ratio 
similar  to  (6.1).  In  order  to  keep  the  notation  simple,  we 
first  suppose  that  the  limiter  is  followed  only  by  an  integrator. 
As  in  Section  4,  the  general  case  is  handled  in  exactly  the 
same  way;  the  form  of  the  filter  does  not  affect  the  method. 

The  output  x  (•)  is  given  by 

xe  =  Kesign(s(t)  +  nc(t)), 

where  is  a  scale  factor  whose  value  will  not  affect  the  power 
ratios.  In  fact,  it  is  convenient  to  use  K£  =  L/e,  which  we  will 
do.  Then 


(6.2)  xe  =  ^  sign[s(t)  +  y(t/e2)/e]. 


I 


Although  (6.2)  differs  from  the  forms  (2.1)  used  in  Theorems  2 
and  3,  particularly  because  of  the  1/e  factor  appearing  both 
inside  and  outside  the  sign  function,  Theorem  1  can  still  be 
used  and  a  proof  similar  to  that  of  Theorem  2  gets  the  correct 
limit  and  the  construction  of  the  fc(*).  We  go  through  some 
of  the  details  below,  in  order  to  illustrate  the  versatility 
and  robustness  of  the  technique.  Since  x'  (•)  is  not  involved 
as  either  an  argument  or  coefficient  of  the  sign  function,  the 
scheme  is  not  hard  to  use. 

To  facilitate  computation,  we  let  y(-)  be  Gaussian  with 

2 

correlation  function  o  exp  -a|t|  (a  >  0).  It  will  be  shown 
that  as  z  ■*  0,  xe(*)  converges  weakly  to  a  process  x(*)  which 
has  the  ltd  representation 

(6.3)  dx  =  h/27^  (Hi^l)dt  +  W  2  In  2/a  dB. 

If  a  filter  of  the  form  used  in  Fig.  2  follows  the  limiter  (where 
we  set  Q  =  0  to  avoid  white  noise  in  the  output) ,  then  the  limit 
equation  is 

(6.4)  dv  =  Dvdt  +  Edx 

z(t)  =  output  =  Cv(t). 


i 
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Input-output  signal- to-noise  ratios.  The  integrated  input 

noise  /  n  (s)ds  converges  weakly  (as  in  Section  4)  to  a  Wiener 

0  2 
process  whose  covariance  at  time  t  is  2o  t/a  =  2Rt  = 

oo 

2t  /  Ey(0)y(s)ds.  Thus,  as  c  ■*  0,  the  input  power  per  unit  BW 

0  2 
(in  any  finite  frequency  range)  converges  to  2a  /a.  In  order  to 

get  a  concrete  power  ratio  comparison  with  Davenport's  result, 
set  s(t)  =  A  cos  pt  here  only.  Consider  the  form  (6.4)  where  the 
system  dimension  and  D,  E,  C  are  chosen  to  get  a  good  approxima¬ 
tion  to  a  zonal  filter  whose  pass  band  includes  p.  The  filter 
gain  is  unimportant  since  it  does  not  affect  the  ratio,  so  we 

assume  that  it  is  unity  in  the  pass  band.  The  noise  power  per 

2 

unit  BW  in  dx/dt  is  L  2  In  2/a,  the  limit  output  noise  power 

2  2  2 

per  unit  BW  as  e  +  0.  The  signal  power  in  dx/dt  is  L  A  /tto  , 
the  limit  output  signal  power  as  e  -*■  0 .  Thus 


(Signal/Noiae  power  per  unit  BW)  ./(Signal/Noise  power  per 

unit  BW) . 

in 

(6.5)  =  (^>/(— I  -?■  2>/(|^r>  -  T-nrr  ' 

ira 

which  is  slightly  greater  than  Davenport's  [1]  limit  ratio  (as 
his  N  -*■  °°)  of  it/4  . 

This  closeness  of  the  two  results  is  very  pleasing.  Since 
our  assumptions  are  different,  it  suggests  that  our  scheme  might 
yield  results  that  are  meaningful  under  other  circumstances 
where  similar  averaging  phenomena  occur.  In  our  case,  the 


input  noise  energy  per  unit  BW  is  held  constant  and  the  BW 
increased.  In  [16],  the  BW  is  held  fixed  and  the  power  per  unit 


BW  °°  (to  get  the  tt/4  limit  ratio).  In  [16  1  »  the  ratio  seems 
to  decrease  as  the  input  noise  power  increases,  which  is  consis¬ 
tent  with  our  result  2/n  In  2  >  tt/4,  since  our  power/unit  BW  is 
held  fixed.  The  "averaging"  phenomena  in  both  cases  are 
similar  -  in  that  the  existence  of  the  limit  makes  implicit  use 
of  the  "wild"  fluctuations  and  "large"  magnitude  of  the  noise. 

In  a  promising  study  currently  under  way,  a  phase-locked 
loop  with  a  saturator  like  non-linearity  is  being  studied  and 
compared  (favorably)  to  the  more  standard  systems.  Asymptotic 
methods  such  as  described  here  are  used.  They  seem  to  be  the 
only  available  tool. 

Now  an  outline  of  the  proof  that  the  xe ( • )  of  (6.2) 
converges  weakly  to  the  solution  of  (6.3)  will  be  given.  The 
proof  for  the  general  filter  case  with  limit  (6.4)  is  about  the 
same . 

Theorem  4.  Let  s ( • )  be  continuous,  y ( • )  Gaussian  with  covariance 
2 

a  exp  —  a | t |  and  mean  zero.  Then  {x°(*)}  is  tight  and  as  e  ■>  0, 
converges  weakly  to  the  process  x(-)  given  by  (6.3)  (integrator 
only  used)  or  (6.4)  (general  filter  following  the  limiter  used) . 

Proof .  We  stick  to  the  integrator  case.  The  general  case 
requires  only  carrying  an  extra  "drift"  term,  and  is  done  in 
exactly  the  same  way.  denotes  conditioning  on  y(u/e^),  u  <  t. 


-V/- 

-•m- 
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Part  1.  Set  y(*)  =  oz  (•),  where  z  (•)  has  correlation 
exp  -a|tj  and  let  z  denote  a  random  variable  with  the  normal 
N(0,1)  distribution.  The  factor  L  is  unimportant,  so  set  L  =  1 
here.  We  evaluate  Ge(s)  =  E  sign  (s+az/e ) .  Then 

(6.6)  G£(s)  =  [erf(J-)  -  erf  (=§£)]  =  S2j^  |  e  +  o(e) 

where  o(e)/c  -*  0  as  e  -*■  0,  uniformly  in  s  in  any  bounded  set. 
Define  G(s)  =  /2/i  (s/a)  and 


F£  (s,  aza  (t) /e)  =  [sign  (s+aza  (t) /£)  -  E  sign  (s+aza  (t ) /£)  ] 

=  [sign  (s+oza(t)/e)  ]  -  G£(s)  . 

Then 

(6.7)  x£  =  /2/^  ?  +  0(e)  +  Fc  (s,»za(t/e2)/e)/e. 

Some  details  will  be  omitted.  We  note  that  F£(s,t)  is  p-right 
continuous  since  y(-)  has  a  continuous  density  -  similarly  with 
other  functions  below  for  which  this  property  is  needed. 

The  aim  now  is  to  apply  Theorem  1.  Given  f  £  5^'^, 

{f  }  must  be  found  such  that  the  "p-lim"  requirements  of 
Theorem  1  hold.  The  other  conditions  of  Theorem  1  are  satis¬ 
fied,  where  A  is  the  operator  of  the  process  (6.3).  The  method 
of  proof  of  Theorem  2  will  be  used  to  get  both  {fe}  and  A. 


-V.2- 


-w- 


Similarly  to  the  situation  in  Theorem  2,  f  will  have  the 
form  (no  fg  is  needed,  since  is  not  random)  f  (t)  = 
-f(xe(t))  +  fj(xc(t),t)  +  (*E (t) , t) .  Setting  x  =  xr(t), 

s  =  s(t)  for  notaticnal  simplicity,  we  have 


(6.8)  Aef(x,t)  =  f  t  ( x ,  t )  +  fx(x,t)  [Ge  (s)  i  F£  (o,  oz3*.  t/t2)/e)/e] 


Define 


fj(x,t)  =  7  |fx(x,t+u)E®FE(s(t+u) 


,  cza (^j^)/e)du 
e 


=  e 


f  (x,  t+ezu)  E^F  (s  (t+e^u)  ,cza  (^+u)/e)du 

X  u  t.  _  ^ 


Owing  to  the  fact  that  F^  is  "centered"  about  its  expectation 
and  to  the  exponential  correlation  of  y(-)»  the  integral  exists 
and  p-lim  f1  »  0.  Also,  f ^  G  9)  (Ae)  and  (,Jse  x  =  xe(t)) 


(6.9)  Aef^(t)  =  -fx(x,t)F£  (S(t) ,  aza(t/e2)/e)/e 

00 

+  -  [  f  (x,t+u)E^F  (s(t+u),oza(±4^)/e)du 

C  i  XX  t  c  _  •• 

o 

F  (s (t ) ,cza(t/e2)/e) 

'  l— - - -  +  G£  (s  (t)  )  ]  . 


Calculations  such  as  showing  f^  €  ^(Ae)  are  not  hard  here, 

since  F^  does  not  depend  on  the  state  x.  Similarly  for  fE  below. 

The  first  term  of  (6.9)  cancels  the  last  term  of  (6.8) 

(which,  of  course,  is  the  reason  for  introducing  f^(*)).  The 

2 

integral  of  (6.9)  exists  and  equals  (change  variables  u/e  -*■  u) 

CO 

(6.10)  [f  (x,  t+t2u)E^F  (s  (t+e2u)  ,oza  (^+u)/e)du 

J  A  A  U  c, 

0 

•  (Fc(s(t),oza  (t/E2)/e)+eGe(s(t))]. 

The  cG^  term  goes  to  zero  in  the  p-lim  sense  as  s  -*•  0,  and  it 
is  ignored  henceforth.  Still  following  the  method  of  Theorem 
2,  let  Aq f ( x , t )  denote  the  expectation  of  (6.10)  (minus  tne 
eG^  term).  Then  (the  integral  exists,  again  by  the  centering 
of  F  and  the  exponential  decrease  in  the  correlation  function 
of  y  ( • ) ) 

00 

A5f(x,t)  =  If  (x,t+e2u)EF(o(tfe2u),aza(u)/c)F(s(t),cza(0)/E)du. 
U  j  xx  e  ^ 

0 

Next,  define  ,  the  "centered  and  averaged"  last  term  of 

(6.9)  (minus  the  G£  term),  by 

00 

oo  » 

f  2  (x,  t)  “  d  v{  jdv  [fxx  (x,  t+u+v)ijE^F£  (s  (t+u+v)  ,fza  (^p^) ) 

0  0  E  e 

F£  (s(t+v)  ,fza(^)  )]-  AEf(x,t+v)}. 


It  can  be  shown  that  the  integral  exists,  again  by  use  of 

the  centering  about  the  mean  A£f(x,t+v)  of  the  inner  integral, 

and  the  exponential  correlation  function.  Via  the  usual 

2  2 

change  cf  variables  u/c  -*•  u,  v/e  -*•  v,  we  get  that 
E  j  f 2 (x, t )  |  =  0(c2).  Also  f£  E  ^ (A£ )  and  (use  x  =  xc (t)  ) 

Ae f 2 (x , t )  =  minus  first  term  on  r.h.s.  of  (6.10)  plus 
terms  whose  (absolute)  expectation  is  0(c) 

+  A^f (x, t ) . 


Then,  concluding, 

(6.11)  p-lim[fc (• )  -  f(x£(-),*)]  =  0, 


p-lim[Aefe (*)-ft(xe (•) ,-)-A£f (xe(-) , *)-Ge(s)fx  (x£(-) , •) ]  =  0. 


In  Parts  2  and  3  below,  it  is  shown  that  ((6.12)  defines  Ag) 
f  (x,t)ln  2 

(6.12)  Agf(x,t)  -+■  — — - — -  =  Agf(x,t)  uniformly  in  x  for  each  t. 


This,  together  with  G^ts)  -*•  G(s)  uniformly  on  bounded  s-sets 

t 

(hence  f^G^  -*■  fxG  uniformly  in  x  for  each  t)  yields  the  theorem,  since 
the  process  of  (6.3)  is  the  unique  process  corresponding  to  the 
operator 


O/at  +  A)  =*  (|t  +  G(s(t))|^  +  j(- 


ln  2) 


3x 


a 


Let  z(*)  denote  a  Gaussian 


Part  2.  Evaluation  of  Agf. 
process  with  correlation  function  exp  -|t|.  Changing  variables 
u(old)  -*•  u(new)/a  yields 

A^f  (x,t)  =  |Iq (x,t) 

where 


(6.13)  ljj(x,t) 


*  j  fXx(x't+e2u/a) (Q£(t,u)  -  Re(t,u)]du, 


where 


Qc (t,u)  =  E  sign [s (t+ue2/a)  +  7-  z(u)]  sign  Is (t)+-z  (0) ]  , 

e  e 

2 

Rc(t,u)  =  E  sign  [s  (t+^~ — ) +— z  (u)  ]  E  sign [s  (t) +— z  (0) ] . 

a  c.  e 


Owing  to  the  properties  of  the  joint  distribution  of 
T 

(z(0),z(u)),  / | Qe (t ,u) -R£ (t,u) |du  is  bounded  uniformly  in  e  and 
0 

T  and  converges  uniformly  in  e  as  T  (In  fact,  the  integrand 

goes  to  zero  at  an  exponential  rate  as  u  °°.)  Using  this  and 

the  smoothness  and  compact  support  of  f  (*,•),  we  can  replace 
2  £ 

fxx(x,t+£  u/a)  by  £  (x,t)  in  Ig(x,t)  without  altering  the  limit 

as  e  -*•  0. 

By  the  above  arguments,  if  Qc (t ,u) -Rc (t ,u)  has  a  limit  for 
each  t  as  e  ■+•  0 ,  then 


(6.14)  lim  Aq  f (x,t) 

£-*•0 


i 

a 


lim[Qe (t,u)-Re (t,u) ]du 
£-0 


0 


and  also  that  in  order  to  show  (6.12)  it  is  enough  to  show  that 
the  integral  on  the  right  equals  In  2.  First  the  existence  of 
the  limit  will  be  shown.  Let  s+  =  s(t+e2u/a)  and  s  =  s(t). 

Then 

Qe(t,u)  =  P  (z (u) >-s+e/o,z (0) >-ss/o  }+P {z (u)  <-s+e/c,z (0) <-se/a} 
-P{z  (u)  >-s'1  e/c ,  z  (0)  <-se/a }-P  {z  (u)  <-s+e/o,z  (0)  >-se/o} 

Re (t ,u)  ■  [P{z (u) >-s+e/a}-P{z (u) <-s+E/a}] 

•  [P{z (0) >-se/a}-P{z (0) <-se/a}] . 

Obviously  as  e  -*■  0 ,  Re(t,u)  -►  0  (even  uniformly  on  bounded 
a,  s+#  t,  u  sets,  although  we  don't  need  this).  Also  (even 
uniformly  as  above) 

Q£(t,u)  -*•  P{z  (u)  >0  ,z  (0)  >0  }  +  P(z  (u)  <0  ,z  (0)  <0  } 

-  P(z (u) <0,z (0) >0}  -  P(z (u) >0,z (0) <0} 

=  2 [P{z (u) >0,z (0) >0}  -  P{z (u) <0,z (0) >0}] 

=  2J(u)  . 


Define 


I 


-*7- 


Jo  = 


J  (u) du. 


Then  we  have  proved  that 


Ajf(x,t)  ^  xx 


2f„~(x/t) 


J  (u) du  =  Agf (x,t) 


uniformly  in  x  for  each  t.  We  need  only  evaluate  J 


Part  3.  Proof  that  JQ  *  (In  2)/2.  Use  polar  coordinates 


and  write  p  =  e  u.  Then  the  joint  density  of  (z(0),z(u))  is 


(l-p‘) 


1  1 ' 172  (exp  -r2g(0)/2)r» 


g(9)  =  — ^-~[cos‘‘0-2psin0cos0+sin20  ]  =  — 1  -=-[l-psin20  ] 
1-P  1-P 


Also 


V2  00  0 

J+(u)  £  P{z  (0)  >0,z  (u)  >0}  =  [  f  - yt0  ( exp  -5— g  (0 )  )  drdQ 

q  q  2Tr(l-p2)i/^  2 

0  00 


J"(u)  =  P{z  (0)  >0/Z  (u)  <0 }  =  j  j  - r 2  ^  (exp-^-g  (9)  )drd6. 

-tt/2  0  2it(1_p  > 


Integrating  with  respect  to  r  yields 


h : 2 

'  lu)  -  I : 


d0  (1-p2) 


- ->  ~T7T 

2n  ( 1-P~)  ^  (l-psin20) 


1 

2 


r  d  e  (1-P2)1/2 

l-psin©  2* 


1 


By  [16,  eqn.  298  ], 


1  1  f 


m pp 


32' 


J±(u)  =*  |  tan'1  —g— r/2  -  ^  tan  1  - ±6-™- 

W  (l-p2)1/2  71  (1-p2) 1/2 


Using  tan_1x  -  tan_1y  =  tan"1 (x-y) / (1+xy)  [17,  p.  48],  we  have 


J+(u) 


-  J  (u) 


-P 


" " tan’1  Ti-pV'2 '  *  tan'1  (i-p^i/2 


i  _i  _ _ P _ 

+  =■  tan  ,2.1/2 


IT 


=  J-  tan'1 


(1-P) 
P 


(1-p") 


2J7I 


1  .  -1 
=  —  sin  p. 


Next,  let  p  =  e-t  and  change  variables  v  =  e  fc,  t  =  -In  v,  to 
get 


Jo  -  ¥  I  sin'1<e't)dt  -  ?  . 


r  sin  v 


dv 


With  v  =  sin  w  D.7  ,  p.  417]  , 


n/2 


Jq  =  i  |  w.ctn  w  dw  =  j  In  2.  Q.E.D. 
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